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ABSTRACT 

Type MB restriction-modification systems, such as 
Bcgl, feature a single protein with both endonucle- 
ase and methyltransf erase activities. Type MB nucle- 
ases require two recognition sites and cut both 
strands on both sides of their unmodified sites. 
Bcgl cuts all eight target phosphodiester bonds 
before dissociation. The Bcgl protein contains 
A and B polypeptides in a 2:1 ratio: A has one cata- 
lytic centre for each activity; B recognizes the DNA. 
We show here that Bcgl is organized as A 2 B 
protomers, with B at its centre, but that these 
protomers self-associate to assemblies containing 
several A 2 B units. Moreover, like the well known 
Fokl nuclease, Bcgl bound to its site has to recruit 
additional protomers before it can cut DNA. 
DNA-bound Bcgl can alternatively be activated by 
excess A subunits, much like the activation of Fokl 
by its catalytic domain. Eight A subunits, each with 
one centre for nuclease activity, are presumably 
needed to cut the eight bonds cleaved by Bcgl. Its 
nuclease reaction may thus involve two A 2 B units, 
each bound to a recognition site, with two more A 2 B 
units bridging the complexes by protein-protein 
interactions between the nuclease domains. 

INTRODUCTION 

Restriction-Modification (RM) systems use two enzyme 
activities to defend bacteria against foreign DNA, a re- 
striction endonuclease (REase) and a modification 
methyltransferase (MTase) (1,2). The MTase transfers a 



methyl group from S-adenosylmethionine (SAM) to a 
cytosine or an adenine within a particular DNA 
sequence, the recognition site for that system, while the 
REase cleaves DNA with unmethylated sites (3). The 
majority of RM systems fall into either the Type I or 
Type II categories (4). Most Type I systems are oligomeric 
(R 2 M 2 S) proteins containing subunits for DNA cleavage 
(R), methylation (M) and sequence specificity (S) (2,5,6), 
though in some cases the R, M and S functions are all 
carried as domains within a single polypeptide (7). The 
cleavage activity is triggered by unmodified sites, though 
it often occurs distant from the site, due to the R subunits 
possessing an ATP-dependent DNA translocase. The 
main reaction of the MTase is to convert hemi-methylated 
sites, methylated in one strand only, to fully modified 
products methylated in both strands. Neither the fully 
nor the hemi-methylated DNA is cut by the REase. 
Consequently, DNA fully modified once by the MTase 
is never cleaved by the REase even after its 
semi-conservative replication. 

Most Type II RM systems use separate proteins for 
restriction and modification that both bind to the same 
DNA sequence (1,8,9). The REase is commonly a 
homodimer that recognizes a palindromic sequence and 
cuts both strands at specified loci within the site: one 
subunit attacks the target phosphodiester bond in one 
strand, while the other cuts the scissile bond in the 
opposite strand (9,10). The MTase is usually a monomer 
whose main role is to transfer one methyl group onto 
DNA already methylated in one strand (8). But many 
Type II systems deviate considerably from these classical 
systems. They have been categorized into numerous 
subtypes — IIG, HE, IIF, IIS and IIB, among others — on 
the basis of their genetic organization, their mode of 
action and their recognition sites and cleavage loci (4). 
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Perhaps the only aspect common to all Type II REases, 
and the one that differentiates them from Type I systems, 
is that they cleave DNA at fixed positions relative to their 
recognition sites. 

Variations in genetic organization include the Type IIG 
systems that, instead of separate REase and MTase 
proteins, possess a single polypeptide with both activities 
(11-14). The IIG systems require SAM not only as a 
co-factor for their methylation reactions but also to 
activate the nuclease. 

Variations in mode of action are illustrated by the Type 
HE and IIF systems (15,16). While the orthodox enzymes 
act at individual sites, the HE and IIF REases need two 
copies of their target sequence for full activity. The HE 
enzymes, often dimers, proceed to cut both DNA strands 
at one of the two sites; the other site acts as an allosteric 
activator (17,18). In contrast, the Type IIF enzymes tend 
to cut both strands at both sites before leaving the DNA 
(18,19): most operate as tetramers of identical subunits, 
with the four active sites, one in each subunit, each pos- 
itioned to cleave one of the four target bonds (20-22). 
Enzymes that require two DNA sites can be identified 
by comparing their activities on DNA substrates with 
one or two cognate sites (19,22-24). Proteins that 
interact with two sites prefer sites in cis, where they loop 
out the intervening DNA, over sites in trans, where they 
must hold together two separate DNA molecules (16). 
Consequently, both Type HE and IIF REases can cleave 
two-site substrates more rapidly than a one-site DNA. 
However, HE enzymes cut two-site substrates one site at 
a time, whereas IIF enzymes cut both sites together (18). 

Variations in recognition and cleavage loci are shown 
by the Type IIS REases, which bind asymmetric rather 
than palindromic sites and which cut the DNA at fixed 
loci away from the sequences, on one side of it (25,26). For 
example, Fokl recognizes 

5' - GGATG - (N) 9 4. -3' 

3' - CCTAC - (N) 13 t -5' 

and cuts both strands at the positions indicated (^,tX 
9 and 13nt away (3). An asymmetric sequence cannot be 
recognized in the same way as a dimeric enzyme 
recognizing a palindromic site, where each subunit 
contacts one half of the site (9,10). Instead, either a mono- 
meric protein contacts the entire site, as is the case with 
Fokl (27), or different subunits in an oligomeric protein 
each contact discrete parts of the sequence, as with BbvCI 
(28). If the nuclease is a monomer, it may have only one 
active site (26), yet it needs to cut both 5—3' and 3—5' 
strands. The use of the same active site to cut both 
strands is often — but not always — prohibited, as it 
requires the site to rotate not only around the DNA, 
from one strand to the other, but also perpendicular to 
the DNA, to match their antiparallel nature (29). To cut 
both strands, the monomer of Fokl bound to the DNA 
associates with a second monomer (30-32) to give a 
dimeric unit with two active sites (33). The second active 
site can alternatively come from the isolated catalytic 
domain of the Fokl protein, without its DNA recognition 
domain (30). The two catalytic centres each cut one 



specified strand: the DNA-bound monomer, the bottom 
strand 13nt away and the recruited unit, the top strand 
9nt distant (34). However, the enzyme at the site associ- 
ates more readily with another monomer bound elsewhere 
in the same DNA than with one in free solution, as 
proteins in cis cannot escape from each other (31,35). 
Fokl thus attacks two-site substrates more rapidly than 
one-site DNA, although the two-site substrate is initially 
cut at just one site: the residual site is then cleaved at the 
same slow rate as the one-site DNA (23,31). The need for 
two recognition sites is common among the Type IIS 
REases: some, like Fokl, act as HE enzymes; others, like 
the tetramer BspMI, as IIF enzymes (36). 

Another group that cleave outside their recognition sites 
are the Type IIB systems (37). Unlike the IIS enzymes that 
cut the DNA on one side of the site, the nucleases from the 
Type IIB systems make two double-strand breaks (DSBs) 
at each site, one either side, to excise a short fragment that 
carries the recognition sequence. Moreover, their recogni- 
tion sequences usually consist of two specified but diver- 
gent blocks of sequence separated by a non-specific 
segment of fixed length, like many Type I sites (2,5). 
For example, Bcgl, an archetype of the IIB enzymes 
(12,38^10), recognizes the sequence, 

5'- | 10 (N) - CGA - (N) 6 - TGC - (N) 12 1 -3' 

3'- tn (N) - GCT - (N) 6 - ACQ - (N) 10 f -5' 

Its REase cuts both strands on both sides of this sequence, 
10 and 12 nt away, to leave the site on a fragment 34 nt long 
in both strands (noted below as the 34-mer): its MTase 
targets the adenines marked above in italics. However, 
the Bcgl nuclease has no activity when bound to a 
solitary site (41). Instead, like most Type IIB REases 
(24), it needs two sites (40), ideally in cis but alternatively 
in trans. After bridging the sites, it then proceeds to make 
two DSBs at both sites before releasing the final products, 
thus cutting a total of eight phosphodiester bonds within a 
single complex: hardly any of the partially cleaved inter- 
mediates accumulate during the course of the reaction (41). 

Bcgl, like many but not all Type IIB systems (37), is 
composed of two subunits: an A subunit of 71.6 kDa that 
carries both MTase and REase catalytic functions; a B 
subunit of 39.2 kDa that recognizes the target sequence 
(12,39). Like the Type IIG systems that have both 
activities in the same subunit, Bcgl requires SAM not 
only for its MTase but also for its REase function: the 
nuclease also needs Mg 2+ (38). Densitometry of 
coomassie-stained SDS gels yielded a 2:1 ratio of A:B 
subunits (12), albeit with the assumption — not always 
valid (42) — that the two polypeptides stain to the same 
extent. However, size exclusion chromatography of 
native Bcgl gave apparent MW values of 250-300 kDa 
(39,43), larger than the 182 kDa expected for a protein 
with one B and two A subunits. Nevertheless, the size 
exclusion MW app can be related to an A 2 B protomer if 
that unit has a large frictional ratio or if it self-associates 
(44). 

The A subunit of Bcgl resembles a fusion of R and M 
subunits from a Type I system without the ATP motor 
functions of R, while B is strikingly similar to an 
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S subunit: it likewise possesses separate domains for 
recognizing each section of its bipartite target site (39). 
An A 2 B assembly for Bcgl is thus a direct analogue of 
the R 2 M 2 S organization of Type I systems. Other Type 
IIB systems contain a single subunit with all three func- 
tions (13,45), akin to the single-polypeptide Type I 
enzymes (7). But while Type I REases generally make 
one DSB, usually about halfway between two recognition 
sites, Bcgl makes four DSBs per reaction (41). How the 
Bcgl RM protein is organized to hydrolyse eight phospho- 
diester bonds in a complex with two recognition sites is 
addressed here. In the following article (46), we show that 
the Bcgl MTase transfers to an individual site in the DNA 
just one methyl group at a time. Hence, this protein may 
be organized differently for its REase and MTase 
activities. 



MATERIALS AND METHODS 

DNA 

The plasmid pARG3-BcgI (a gift from New England 
Biolabs) over-expresses the native forms of bcgIA and 
bcgIB to yield the wild-type (WT) Bcgl protein. The 
plasmid was subjected to site-directed mutagenesis 
(QuikChange II: Agilent) to replace Glu53 in the A 
subunit with Ala, to give pARG3-BcgIE53A [the E53A 
mutant has no nuclease activity (39)]. To construct the 
plasmid pAJJl, the section of pARG3-BcgI encompassing 
bcgIA was amplified by PCR and the resultant fragment 
cloned between the Ncol and Xhol sites of pET28a 
(Merck). 

The supercoiled (SC) forms of the plasmids pUC19 and 
pDG5 were 3 H-labelled and purified as described previ- 
ously (19,24). The following duplexes, all with HEX 
(hexachlorofluorescein) at the 5'-end of the top strand, 
were as before (41): HEX-42S, a 42 bp specific DNA 
with the recognition sequence for Bcgl and both 
upstream and downstream cleavage loci; HEX-42NS, a 
42 bp non-specific DNA that is identical to HEX-42S 
apart from single bp changes in each segment of the rec- 
ognition sequence. HEX-24S and HEX-24NS are 
truncated versions containing the central 24 bp of 
HEX-42S and HEX-42NS respectively: HEX-24S thus 
has the recognition sequence but not the cleavage loci. 

Proteins 

Transformants of Escherichia coli XLl-Blue MFR with 
either pARG3-BcgI or pARG3-BcgIE53A, or E. coli 
Rosetta 2 with pAJJl, were grown in 20 L L-broth with 
the appropriate antibiotics at 37°C in a Applikon 
Biotechnology bioreactor to an OD 60 o of 0.5-0.7. IPTG 
was added to 1 mM and the culture continued for 3^1 h 
before it was passed through a GEA Westphalia separator 
prior to cell harvesting by centrifugation. WT Bcgl, the 
E53A mutant and the A protein were all purified by 
essentially the same procedure as described before for 
the WT protein (41), except that the initial extract — after 
cell disruption and the removal of the debris — was 
fractionated, first with polyethyleneimine and then 
with ammonium sulphate (47), prior to the column 



chromatography steps. As neither BcgIE53A nor the A 
subunit alone possess nuclease activity, the purification 
of these proteins was monitored by SDS-PAGE: the 
final preparations of all three proteins contained no de- 
tectable impurities (48). The identities of the purified 
proteins were confirmed by N-terminal amino-acid 
sequencing (W. Mawby, personal communication). 
Approximately 50% of the A subunit carried methionine 
at its N-terminus, as specified by its gene sequence, and 
50% valine, the second residue in the sequence: the latter 
presumably arises from post-synthetic processing. Protein 
concentrations were assessed from A 2 so measurements 
using molar extinction coefficients calculated from 
amino acid compositions. Molarities of WT Bcgl and 
E53A refer to the A 2 B assembly and that for the A 
protein to its monomeric state. Molecular masses encom- 
pass four possible states of the A protein (46): with and 
without the N-terminal methionine, and with or without 
bound SAM. 

Native mass spectrometry 

Aliquots of BcgIE53A were dialysed overnight against 
200 mM AmAc (ammonium acetate), 10% glycerol, pH 
8.4, and flash-frozen. When required, the samples were 
thawed on ice and buffer-exchanged twice with 200 mM 
AmAc using Micro Bio-Spin P6 columns (Bio-Rad) before 
being diluted in AmAc to the requisite concentration. (In 
this buffer, E53A still bound DNA specifically and the 
WT enzyme cleaved DNA when MgCl 2 was added: data 
not shown.) 

The protein sample was loaded into the mass spectrom- 
eter using nano-electrospray ionization (nano-ESI) in 
positive ion mode with an automated Advion Triversa 
inlet system (5um nozzle diameter, spray voltage 
1.75 kV, gas pressure 0.25 psi). Mass to charge values 
were determined using a Waters Synapt 1 T-wave ion 
mobility quadrupole/time-of-flight tandem mass spec- 
trometer with an 8 k quadrupole for medium-to-high 
m/z transmission. Experimental parameters were chosen 
to effect desolvation of high m/z sample ions while main- 
taining conditions that promote the transmission of intact 
protein complexes, with minimal disruption of native con- 
formations (49): backing pressure, 5.4 mbar; source tem- 
perature, 25° C; sampling and extraction cone, 180 and 
0 V, respectively; trap and transfer collision energies, 60 
and 40 V; trap DC bias, 80 V. Data analysis was con- 
ducted using MassLynx 4.1 software (Waters) without 
background subtraction using smoothing by the 
Savitzky-Golay method (±50 channels). 

Analytical ultracentrifugation 

Sedimentation equilibrium experiments were carried out 
at 20° C in six-channel centrepieces for a Beckman XL- A 
ultracentrifuge. Sample chambers contained the protein — 
either WT Bcgl, E53A or A protein — in HOul analytical 
ultracentrifugation (AUC) buffer (20 mM Tris-HCl, pH 
8.4, 66 mM NaCl, 2mM CaCl 2 , 20 uM SAM and 10% 
glycerol) and reference chambers 120 ul AUC buffer. 
During centrifugation, absorbancies were recorded along 
the centrifugal radius at 230, 280 or 292 nm, depending on 
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the protein concentration. An initial scan was made 
directly on reaching 3000 rpm and the horizontal trace 
used to determine the protein concentration in the cell. 
Further scans were made after >16h at speeds between 
6000 and 15 000 rpm: at least two scans, >4h apart, were 
taken at each speed. A final scan was recorded after 6h at 
40 000 rpm, to yield from the meniscus a baseline absorb- 
ance (B). Data sets were fitted in ORIGIN to Equation 1 
for the distribution of a single ideal species after sedimen- 
tation to equilibrium: 

A r = A*.J^-wX 2 -'®\+B (1) 

where A r and A ro are the absorbance at radius r and at the 
reference r 0 , m the centrifugal speed, v the partial specific 
volume and p the solvent density: when applied to a single 
species, M denotes the molecular mass of that species but 
when applied to a mixture of species of varied mass but 
uniform v, M denotes the mean weight-average MW (M u .) 
of all of the species in the cell (44,50). Multiple species 
were considered to be present whenever the residuals 
between the best fit to Equation 1 varied systematically 
from the experimental data. Further AUC experiments 
were carried out as above, but monitored at 539 nm, on 
samples containing either the HEX-42S or the HEX-42NS 
duplex and excess Bcgl. 

Reactions 

DNA cleavage rates were generally measured by adding 
the requisite concentration of Bcgl protein to 200 ul 
3 H-labelled DNA (5 nM) in buffer R (AUC buffer supple- 
mented with 1 mM DTT and 200 ug/ml BSA, and with 
10 mM MgCl 2 in place of the CaCl 2 ) at 37°C: the DNA 
was the SC form of either pUC19 or pDG5, which carry, 
respectively, one or two Bcgl sites (24,41). Samples (10 ul) 
were taken before and at varied times after adding the 
enzyme, and mixed immediately with an EDTA quench 
prior to electrophoresis through agarose to separate the 
following DNA species: the intact SC substrate; the nicked 
open circle (OC) form; the linear (LIN) species with at 
least one DSB at one site and, for the two-site substrate, 
the final products with at least one DSB at both sites (LI 
and L2). The concentration of each form was evaluated by 
scintillation counting. The reactions at each Bcgl concen- 
tration were repeated at least three times, although occa- 
sionally with different sampling times for each test. For 
repeats with the same time points, the data at each time 
were averaged and the averages fitted to a single exponen- 
tial decay with offset. For reactions sampled at different 
times, the decrease in SC DNA in each reaction was fitted 
to an exponential and the apparent rate constants then 
averaged. 

Reactions that were too fast to measure by manual 
sampling were carried out by mixing, in a Hi-Tech 
RQF-63 quench-How apparatus (TgK Scientific), equal 
volumes (80 ul) of 3 H-labelled pDG5 (10 nM) and Bcgl 
(at twice the required concentration), both in buffer R at 
37°C. After the requisite delay time, the reaction was 
mixed with 80 ul EDTA (500 mM). Samples (100 ul) of 
the quenched reaction were then analysed by 



electrophoresis through agarose as above, to yield the 
residual concentration of the SC DNA. 

For Bcgl reactions in the presence of additional A 
protein, plasmid cleavage and the release of the 34-mer 
were measured in parallel (41). The reactions, in 200 ul 
buffer R* (as Buffer R but 5% [w/v] Ficoll 400 instead 
of the glycerol) at 37°C, contained 5nM 3 H-labelled 
pUC19 and specified concentrations of both native Bcgl 
and BcgIA proteins. Samples (20 ul) were taken from the 
reactions at various times, mixed immediately with quench 
supplemented with 250 ug/ml proteinase K and incubated 
at 37°C for >30min. Half of each sample was analysed by 
electrophoresis through agarose, as above, to evaluate the 
concentrations of the DNA species that could be resolved 
on agarose (the SC, OC and LIN forms of the plasmid). 
The other half was loaded onto a native 20% acrylamide 
gel, alongside an equivalent amount (5nM) of Hhal- 
digested pUC19. The acrylamide gel was stained with 
SYBR Safe (Invitrogen) for 20min, washed and its fluor- 
escence recorded in a Typhoon Phosphorlmager 
(Molecular Dynamics) using its 488 nm laser and 520 BP 
40 emission filter. Images were analysed using 
ImageQuant software. The concentration of the 34-mer 
was determined relative to the known concentration of 
the 28 and 33 bp fragments from the Hhal-digested 
pUC19 control. 

DNA binding 

Tests to measure DNA binding by Bcgl were carried out 
at 37°C in 10 ul buffer R*C (as Buffer R* but with 2mM 
CaCL instead of the MgCl 2 ) containing a HEX-labelled 
oligoduplex (20 nM) and varied concentrations of native 
Bcgl. After adding the protein, samples were incubated 
for 5min and then loaded directly onto 6% acrylamide 
gels in TBC (as TBE but with 2mM CaCl 2 in place of 
the EDTA) running at 4V/cm. After loading, the 
voltage was increased to 12.5 V/cm. Gels were imaged in 
their plates in the Phosphorlmager using its 532 nm laser 
and 555 BP 20 emission filter. Images were analysed in 
ImageQuant and the concentration of free DNA in each 
lane assessed relative to that in the lane without Bcgl. The 
background was taken at an equivalent position in a lane 
lacking DNA. 

RESULTS 

Glu53 in the A subunit of the Bcgl RM protein forms part 
of the PD. . . EXK motif common to many nuclease active 
sites (9) and its replacement with Ala abolishes the 
nuclease activity of Bcgl (39). Hence, to enable studies 
of Bcgl without its cleavage reaction, an over-producing 
strain for the E53A mutant of BcgIA was constructed (see 
'Materials and Methods' section). Preparations of the 
E53A mutant from this strain yielded about twice the 
amount of protein as parallel preparations of the active 
WT enzyme (48). Over-expression of the WT nuclease may 
result in cell death due to cleavage of the chromosome at 
non-cognate sites, as seen with other restriction enzymes 
(51,52), although the Bcgl RM protein carries the MTase 
that ought to protect the recognition sites on the cellular 



Nucleic Acids Research, 2013, Vol. 41, No. 1 395 



DNA (12,39). On the other hand, cells over-producing the 
inactive mutant might remain viable and so generate more 
of the protein. Experiments requiring high concentrations 
of the Bcgl protein were thus facilitated by using the E53A 
protein rather than WT Bcgl. 

Mass spectrometry 

Native mass spectrometry (MS) was carried out primarily 
on the E53A mutant, although equivalent spectra were 
obtained with WT Bcgl albeit with lower signal-to-noise 
ratios (data not shown). Under conditions where non- 
covalent interactions are preserved, the spectrum in 
Figure 1 shows a series of peaks, which are assigned to 
the A and B subunits and their complexes with different 
stoichiometries. The predominant species (~75% of the 
total intensity) is the A 2 B complex with an m/z of 
around 7000, with principal charge states of 28+ and 
29+. The mass calculated for the A 2 B complex is ca. 
182.4 kDa, as averaged over the four different species 
(±N-terminal methionine and ± bound SAM; see 
above), which are not resolved here. The major peak in 
Figure 1 yields an experimental mass of 182 434, which 
matches closely the A 2 B form of Bcgl. In addition, 
further multiples of A 2 B assemblies were observed up to 
at least four such units, an (A 2 B) 4 complex at around m/z 
13 000 (Figure 1, insert). The relative ratios of these 
higher-order complexes in the sample cannot be directly 
measured from the spectrum, as the sensitivity of the ion 
detector decreases with increasing m/z in a non-linear way. 
The time-of-flight system has in theory no upper limit on 
the size of molecule it can detect but the transmission and 



the detection of ions depends on their mass. It is therefore 
difficult to determine the relative amounts of each species 
in the solution. 

In addition to the presence of these larger species, some 
separated components of the complexes were also 
observed. The masses of these smaller species, up to m/z 
6000, correspond to the BcgIA and BcgIB subunits by 
themselves and to an AB assembly (Figure 1). These 
species are likely to be subunits that had dissociated 
from the Bcgl endonuclease in solution, as suggested by 
the low charge states of these species (gas-phase dissoci- 
ation products would have had disproportionally high 
charge states, similar to denatured proteins). In an A 2 B 
protomer, the BcgIB subunit would normally be expected 
to have the weakest non-covalent linkage to the assembly 
as its mass, and therefore surface area, is less than that of 
the A subunit: this ought to make it easier for BcgIB to 
leave the complex than BcgIA. Hence, the presence of an 
AB species, but not an A 2 dimer, suggests that the two A 
subunits in the active Bcgl protein both interact with B 
but not with each other: the A subunits presumably flank 
the B subunit, one on either side. If this suggestion is 
correct, then the first step in the disassembly of the 
protein should be the dissociation of one BcgIA subunit 
from the A 2 B unit to leave AB, as is seen in the spectra. 

AUC 

Native MS identified several assemblies of the Bcgl 
protein containing multiple copies of the A 2 B protomer 
but it does not provide quantitative information about the 
amount of each species. Moreover, the native MS was 
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Figure 1. Native MS. A nano-ESI mass spectrum of the E53A form of Bcgl (27.5 uM) in 200 mM AmAc was recorded under non-denaturing 
conditions (see 'Materials and Methods' section). The insert shows the signals for the oligomeric states in the m/z range from 8500 to 14 500 
magnified 5-fold. The charge state series assigned to each molecular species (in bold) are indicated with dashed brackets, and the charge and m/z 
value for the primary peak in each series is given. 
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Figure 2. Sedimentation equilibrium. Samples of Bcgl in AUC buffer 
were centrifuged at 7500 rpm at 20°C for the times noted below. The 
main panels show the absorbance (white circles) as a function of the 
centrifugal radius: (a) 0.44 uM Bcgl at 230 nm after 30 h; (b) 18uM 
Bcgl at 292 nm after 44 h. The lines through the data points in the 
main panels are the best fits of the following data sets to Equation 1: 
(a) the records after 26 and 30 h; (b) after 32, 38 and 44 h. The best fits 
were obtained with values for M of 183 and 431 kDa in (a) and (b) 
respectively. The plots above the main panels display the residuals 
between these fits and the experimental data: the random variations 
in (a) show that the record can be accounted for by a single ideal 
species with a unique value of M; the systematic deviations in (b) dem- 
onstrate that more than one species is present, which gives rise to a M w 
value. 



done in a buffer designed to permit volatilization of the 
sample rather than that for optimal enzyme activity. To 
address these issues, AUC experiments were performed on 
WT Bcgl under conditions similar to those used for its 
DNA cleavage assays, though in Ca 2+ rather than Mg 2+ 
so as to allow for DNA binding without cleavage (53). The 
inactive E53A mutant of Bcgl behaved similarly to the 
WT in both the absence and presence of DNA and, in 
this case, with either Ca 2+ or Mg 2+ at hand (data not 
shown). 
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Figure 3. Self-association of Bcgl. Varied concentrations of Bcgl in 
AUC buffer were loaded into the AUC and sedimented to equilibrium 
at 7500 rpm at 20°C. For each sample, the absorbance was measured as 
a function of centrifugal radius and fitted to Equation 1. Apart from 
the data at the lowest Bcgl concentrations (viz. Figure 2a), the residuals 
between the fits and the experimental data displayed non-random de- 
viations (viz. Figure 2b), so the best fit at each initial concentration 
denotes an M„, value. The M w values are plotted against the loading 
concentrations of Bcgl. The values expected for the 182.4 kDa A 2 B 
protomer and for the 365 kDa (A 2 B) 2 dimer are marked with long 
and short dashes, respectively. 



Sedimentation equilibrium experiments were carried out 
with varied concentrations of WT Bcgl. After centrifuga- 
tion for >20h to achieve equilibrium (as evidenced by 
overlaying traces from successive scans >4h apart), the 
radial distribution of the protein was measured by absorb- 
ance at 280 nm for protein concentrations between 1 and 
IOliM; or at 230 or 292 nm for lower and higher concen- 
trations, respectively (Figures 2a and 2b). At each concen- 
tration, the distribution was fitted to Equation 1, for the 
sedimentation of a single ideal species: the fits encom- 
passed several data sets from different times. At the 
lowest concentration of Bcgl tested (0.4 uM), the residuals 
between the experimental data and the best fit to Equation 
1 were both small and random, showing that the data can 
indeed be accounted for by a single species (Figure 2a). 
The AUC data at low Bcgl concentrations yielded a M W 
of 183 kDa, which corresponds closely to the 182.4 kDa 
expected for the A 2 B protomer. In contrast, the fit of the 
data at high concentrations to Equation 1 gave large and 
non-random residuals (Figure 2b), suggesting either the 
presence of multiple species and/or non-ideality (44). In 
this situation, the best fit to Equation 1 does not yield the 
MW of any individual species but rather the mean weight- 
average MW (M„.) of all of the species in the sample 
(44,50). 

Values for M K were obtained at each loading concen- 
tration of Bcgl tested (Figure 3). The M w values increased 
with increasing levels of Bcgl: from that corresponding to 
a single A 2 B unit at low concentrations, then to values 
consistent with dimers of A 2 B and finally to levels 
indicating assemblies with more than two protomers. At 
the highest concentration tested (18 uM: Figure 2b), the 
M„. came to 431 kDa. As these mean values for M„.encom- 
pass the full range of protein concentrations present in the 
centrifuge cell after sedimentation to equilibrium, from 
low concentrations at the inside edge to high at the 
outside, a significant fraction of the protein in this 
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sample must be present in assemblies containing more 
than two protomers. 

AUC was also used to analyse complexes of WT Bcgl 
with either a specific duplex carrying both recognition and 
cleavage sites (HEX-42S) or a non-specific DNA that 
lacked the recognition sequence (HEX-42NS). 
Sedimentation was monitored at 539 nm, to detect the 
HEX moiety on the DNA, either in its free state or 
when enzyme-bound. In the absence of Bcgl, MW 
values of 27.6 and 26.6 kDa were obtained for HEX-42S 
and HEX-42NS, respectively (data not shown), close or 
equal to the expected MW, 26.6 kDa. When the 
non-specific duplex (2uM) was centrifuged in the 
presence of excess Bcgl (7.5 uM), the initial uniform dis- 
tribution of the HEX label throughout the cell (Figure 4a, 
black trace) was converted into the characteristic profiles 
for an approach to sedimentation equilibrium (Figure 4a, 
coloured traces). The records at equilibrium (>22h) 
yielded a M w of 349 kDa, much larger than that for the 
free DNA so all of the DNA must be bound to the 
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Figure 4. AUC of Bcgl-DNA complexes. The samples, in AUC buffer 
at 20°C, contained WT Bcgl protein (7.5 uM) and a DNA duplex 
(2uM) as follows: (a) HEX-42NS, a non-specific DNA that lacks the 
recognition sequence; (b) HEX-42S, a specific DNA with both recog- 
nition and cleavage loci. The samples were centrifuged in the AUC and 
the absorbance across the cell monitored at 539 nm at the following 
stages after initiating the run: the instrument was initially set to spin at 
3000 rpm and an absorbance scan recorded immediately on reaching 
that speed (black trace). The velocity was then raised to 7500 rpm and 
scans taken after 14 h and after further 8h delays. The records shown 
are after 14 h (green trace), 22 h (cyan trace), 38 h (pink trace) and 54 h 
(blue trace). The data sets with the non-specific duplex (a) recorded 
after >30h were fitted to Equation 1: the best fit gave a M„ of 
349 kDa. 



protein. The M w for the non-specific complex is similar 
to that for the same concentration of free protein 
(Figure 3), which indicates that either two or more 
protomers can bind to one duplex or that the DNA- 
protein complexes associate with themselves. Conversely, 
after adding Bcgl (7.5 uM) to the specific duplex (2uM) 
and spinning the sample at low velocity, the solution 
lacked virtually any absorbance at 539 nm (Figure 4b, 
black trace). Subsequent scans after centrifugation at a 
higher speed revealed a progressive recovery in the A 53 , g 
signal (Figure 4b, coloured traces) but even after 54 h, the 
total absorbance in the cell containing specific DNA was 
still lower than that containing non-specific DNA and had 
yet to reach equilibrium. The complex of Bcgl with 
specific DNA must precipitate out of solution as soon as 
it is formed and then slowly redissolves back into solution. 
An 1 8 bp duplex that contains the recognition site but not 
the cleavage loci also precipitated from solution on the 
addition of WT Bcgl, as did the specific 42 bp duplex 
with E53A in the presence of either Ca 2+ or Mg 2+ (data 
not shown). 

DNA cleavage 

During its DNA cleavage reaction, the Bcgl protein inter- 
acts with two copies of its recognition sequence and then 
proceeds to make four DSBs, one either side of each site, 
thus cutting a total of eight phosphodiester bonds in a 
single DNA-protein complex (41). It is highly unlikely 
that one A 2 B protomer can cut all eight bonds because 
it contains only two active sites for phosphodiester hy- 
drolysis, one in each A subunit. Instead, it seems more 
likely that DNA cleavage is carried out by an assembly 
of A 2 B units, maybe like those seen by native MS and 
AUC. This possibility was addressed by analysing how 
the extent of cleavage of plasmids with one or two Bcgl 
sites varied as the protein concentration was raised 
from below to above the level of Bcgl sites in the DNA 
(Figure 5). Bcgl can act with two recognition sites in cis or 
in trans, although, as is usual for interactions bridging two 
loci (16), it prefers sites in cis over those in trans. Two-site 
substrates are thus cleaved more readily than one-site 
DNA (24,40). 

In previous studies (40), the extent of cleavage of 
plasmids with one or two Bcgl sites was measured at 
varied Bcgl concentrations after a fixed time (15min): 
complete cleavage of the DNA within this time required 
a molar excess of protein over DNA, with more protein 
needed for the one-site than for the two-site DNA. As the 
extent of cleavage after a fixed time interval may reflect 
either the end point or the rate of the reaction, reactions at 
varied Bcgl concentrations were monitored in this study 
over extended time bases (Figure 5). On the one-site 
plasmid, no — or little — cleavage was detected at Bcgl 
levels below or equal to that of the DNA, even after pro- 
longed incubation (Figure 5a). Instead, the reaction on the 
one-site substrate required enzyme concentrations in 
excess of the DNA. The extent of cleavage increased as 
the ratio of enzyme to recognition sites was raised above 
1:1 but complete cleavage was only observed with a sub- 
stantial excess of protein over DNA. The plasmid with 
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two cognate sites might have been expected to require 
more enzyme per mole DNA to fill both sites but 
instead almost all of the two-site DNA was cleaved at 
equal levels of enzyme and DNA: i.e. with Bcgl at half 
the concentration of recognition sites (Figure 5b). 

Enzyme concentrations that gave complete cleavage of 
the DNA did not necessarily give maximal reaction rates: 
for instance, the two-site plasmid (present at 5nM) was 
cleaved fully by both 10 and by 20 nM Bcgl protein but 
the latter gave a faster rate (Figure 5b). Cleavage of both 
one- and two-site substrates was therefore studied across a 
range of Bcgl concentrations far above that of the DNA 
(Figure 6). During these reactions, the concentrations of 
the SC DNA substrates fell exponentially to end points 
where essentially all of the DNA had been cleaved. With 
enzyme in excess over the DNA, the rate constants (fe 0 bs) 
from the exponential decline of the SC DNA denote the 
cleavage of at least one phosphodiester bond in the 
enzyme-DNA complex rather than the entire reaction 
ending with the release of the DNA cleaved at eight 
bonds. Phosphodiester hydrolysis by Bcgl bound to the 
one-site substrate took >2min to reach completion and 
could be monitored by manually mixing the reagents 
(Figure 6a). In contrast, Bcgl bound to the two-site 
plasmid could take <0.2min to cut all of the DNA, so 
in these cases the experiments were done in a rapid 
quench-flow device (Figure 6b). At Bcgl concentrations 
that permitted both hand-mixing and quench-flow 
approaches, the two techniques yielded comparable 
results (data not shown). 

The values for k a ^ s (Figures 6a and 6b: other data not 
shown) were plotted against the enzyme concentration 
(Figure 6c). For both one- and two-site plasmids, the 
rates increased with increasing Bcgl concentrations in a 
hyperbolic manner (Figure 6c). The data with each sub- 
strate were fitted to an equation for a rectangular hyper- 
bola to yield values for /c max (the rate constant for DNA 
cleavage at saturation of the DNA with protein) and for 
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Figure 5. DNA cleavage at low [Bcgl]. Reactions in buffer R at 37°C 
contained one of the following plasmids ( 3 H-labelled, initially ~90% 
SC monomer) at 5 nM and Bcgl protein at the concentrations noted 
below. The plasmids were (a) pUC19. which has one Bcgl site; 
(b) pDG5, which has two sites. The enzyme concentrations for each 
reaction are marked on the right: 2 nM, white circles; 5 nM, black 
circles; 10 nM, white squares; 20 nM, black squares. Samples were 
taken at the indicated times and analysed by electrophoresis to 
separate the SC substrate from the cleaved products. The residual con- 
centrations of the SC DNA were determined and are plotted as a 
function of time. The lines are the best fits to single exponential 
decays with floating offsets. 
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Figure 6. DNA cleavage at high [Bcgl]. (a) The reactions, in buffer R at 37°C, contained 5nM 3 H-labelled SC pUC19 and Bcgl protein at 75 nM, 
white circles; 250 nM, black circles; 750 nM, white squares. After manually mixing the reagents, samples were taken from the reactions by pipette and 
mixed immediately with an EDTA quench. The samples were then subjected to electrophoresis through agarose, to enable the concentration of SC 
DNA to be determined. The amount of SC DNA present in each sample is plotted against the time, (b) As (a) except that the DNA was SC pDG5 
and that the reactions were carried out by quench-flow. (c) Apparent rate constants (/r 0 b S ) from >3 independent experiments at each Bcgl concen- 
tration were averaged and plotted against the concentration: for reactions on pUC19, white circles; for reactions on pDG5, black circles. Error bars 
denote standard deviations. The line drawn through each data set is the best fit to a hyperbolic function, fe„b S — (k msix x [BcgI])/(A'i/ 1 + [Bcgl]). 
The best fits were obtained with the following: for pUC19, £ max = 2.7 min -1 and K. h = 232nM; for pDG5, /c mllx = 21 min - and K, A = 230nM. 
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(a) Specific DNA (b) Non-Specific DNA (c) 




Figure 7. DNA binding, (a and b) The mixtures contained, in 10 ul buffer R*C, 20 nM DNA, either HEX-42S (a) or HEX-42NS (b), and Bcgl at 
one of the following levels, from left to right: 0, 20, 40, 60, 80, 100, 150 or 200 nM. After 5min at 37°C, the samples were subjected to electrophoresis 
through polyacrylamide and the fluorescence from each band recorded. On the left of gel images, F marks the position of the free DNA; C, the 
DNA-protein complexes that entered the gel; and W, the bottom of the wells, (c) From titrations of both HEX-42S and HEX-42NS with increasing 
amounts of Bcgl protein, and from additional experiments with 24 bp duplexes (HEX-24S and HEX-24NS: gels not shown), the amount of free DNA 
in each sample was determined relative to the amount in a parallel lane without enzyme. The % DNA in the free form is shown as a function of the 
Bcgl concentration: HEX-42S, white triangle; HEX-42NS, white squares; HEX-24S, black triangles; HEX-24NS, black squares. Mean values from 
three repeats are given: error bars show standard deviations. The red lines through the data with the specific duplexes denote stoichiometric binding 
where all of the added Bcgl is bound to DNA until saturation of the DNA is achieved. 



K Yl (the protein concentration for half maximal rate). The 
two-site plasmid gave nearly a 10-fold higher value for 
/c max than the one-site DNA, 21min _1 compared with 
2.7 min -1 . In contrast, the one- and two-site substrates 
gave the same value for K Vi , 230 nM in both cases. The 
faster rate of cutting the two-site over the one-site plasmid 
is thus due solely to the different /c max values. The DNA 
cleavage step in the Bcgl reaction on sites in cis is thus 
faster than that on sites in trans. 

The values for K Vl merit two comments. Firstly, they 
indicate that remarkably high Bcgl concentrations are 
required for its maximal reaction rate: cleavage of 5 nM 
DNA at 90% of its maximal rate requires an enzyme con- 
centration of >2000nM. Secondly, though proteins that 
bind two DNA sites nearly always have higher affinities 
for sites in cis than sites in trans (16), the equal Ky 2 values 
show that reactions with two sites in trans feature the same 
affinity for Bcgl protein as that during reactions with sites 
in cis. The K Vl values may therefore reflect a weak protein- 
protein association in the assembly of the reaction 
complex rather than the DNA-protein association. 

DNA binding 

An alternative explanation for why DNA cleavage by 
Bcgl requires such large amounts of protein is that the 
enzyme simply has a low affinity for its recognition 
sequence. To test this possibility, the binding of the Bcgl 
protein to specific and non-specific DNA was examined 
directly by gel retardation (Figure 7). The binding studies 
were initially done with two 42 bp oligoduplexes, both 
labelled with HEX at one 5' terminus. One, HEX-42S, is 
a potential substrate, as it carries both the recognition 
sequence and the cleavage loci both sides of the site. The 
other, HEX-42NS, differs only at 2 bp, one in each 
segment of the bipartite site so is effectively a non-specific 
sequence for Bcgl (41). The buffer used here contained 
Ca" + rather than Mg 2+ , to allow WT Bcgl to bind to 



HEX-42S without cutting off its ends. Increasing concen- 
trations of Bcgl were added to a fixed amount of each 
DNA and the free DNA then separated by electrophoresis 
from the protein-bound species with retarded mobilities 
(Figures 7a and 7b). The extent of binding at each 
enzyme concentration was then determined by assessing 
the fluorescence from the band corresponding to the free 
DNA relative to that in samples without enzyme (Figure 
7c). Similar results (not shown) were obtained with the 
inactive E53A mutant with either Ca 2+ or Mg 2+ and 
with no metal present. 

When increasing concentrations of Bcgl were added to 
the specific duplex, HEX-42S, the concentration of the 
free DNA decreased linearly with the protein concentra- 
tion until it had all been converted to DNA-protein 
complexes (Figure 7c). Bcgl thus binds to DNA carrying 
its recognition sequence in a stoichiometric fashion, where 
all of the added Bcgl is bound to the DNA until saturation 
of the DNA is achieved, as opposed to a dynamic equi- 
librium between free and bound proteins. However, satur- 
ation of the specific duplex seems to require 3-4 mole Bcgl 
per mole DNA, as it took ~70 nM Bcgl to saturate 20 nM 
DNA (Figure 7c, intersection of red lines). Bcgl also 
bound to the non-specific DNA, HEX-42NS, although 
with a lower affinity than the specific sequence 
(Figures 7b and 7c). Moreover, the specific and 
non-specific sequences gave rise to different complexes. 
The specific DNA was initially transformed to species 
that failed to enter the polyacrylamide gel and remained 
in the wells (Figure 7a): species that entered the gel were 
observed only after sufficient enzyme had been added to 
convert all of the DNA to complexes. The nature of the 
complexes that remain in the wells and those that enter the 
gel have yet to be established. They may be related to the 
complexes seen in the AUC, where the addition of Bcgl to 
cognate DNA led initially to an insoluble complex which 
then slowly redissolved back into solution (Figure 4b). In 
contrast, all of the complexes formed with the non-specific 
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DNA entered the gel (Figure 7b) but, as with the specific 
DNA, the species within the gel displayed progressively 
reduced mobilities with increasing amounts of Bcgl, 
probably due to the binding of additional molecules of 
protein to each molecule of DNA. 

Further experiments were carried out with 24 bp 
duplexes with or without the recognition sequence, 
HEX-24S and HEX-24NS: the former contains the recog- 
nition sequence but lacks the cleavage loci 10 and 12 nt 
away from the site. Nevertheless, Bcgl bound to both of 
these 24 bp duplexes in the same manners (gels not shown) 
and with the same affinities (Figure 7c) as their 42 bp 
counterparts. The cleavage loci are therefore not needed 
for specific binding to the site. 

A striking feature of the equilibrium binding of Bcgl to 
its cognate sequence is that a much lower concentration of 
the protein is required to saturate binding to the site 
(Figure 7c) compared with the amount needed to 
saturate the DNA cleavage rate (Figure 6c): 70 cf. 
>2000nM. A scheme that can account for this behaviour 
is a mechanism of the type seen with the Fokl restriction 
enzyme (30-32), 



E+E+S 



a 'dna 



i E .S+E^E 2 S^ 



E 2 P 



In this scheme, one molecule of the enzyme (E) first 
binds to its DNA site (S) with an equilibrium dissociation 
constant A^ DNA but the resultant E.S complex has no 
activity until the DNA-bound protein associates with a 
second protein, with a constant K prot for the protein- 
protein assembly: the complex containing two molecules 
of enzyme then proceeds to cleave the DNA at the & max 
rate. In the case of Fokl (31,32), the K~d for the binding of 
the enzyme to its site (4 nM) is much smaller than that for 
the protein-protein association (in the range from 100 to 
166 nM), so the level of Fokl needed for its maximal DNA 
cleavage rate is much higher than that required to fill its 
binding sites on the DNA. The same seems to apply to 
Bcgl, though the stoichiometry of the binding reaction 
(Figure 7c) implies that even its initial E.S complex must 
contain more than one A 2 B unit, and that it then proceeds 
to bind yet more A 2 B units. Indeed, if the K Vl from its 
maximal cleavage rate (230 nM) corresponds to its value 
for ^Tp r ot, then the E.S complexes for Bcgl and for Fokl 
display similar affinities for their additional enzyme. 

Activation by BcgIA 

The monomer of Fokl at its recognition site can be 
activated by associating not only with intact Fokl 
protein but also with just its catalytic domain (30). The 
association of the intact protein and the catalytic domain 
generates a unit with two active sites that can then proceed 
to cut both DNA strands (33). The nuclease function of 
the Bcgl protein resides in its A subunit while the sole 
function of B is DNA recognition (12,39). Hence, it 
might be possible to activate the Bcgl REase at its recog- 
nition site by excess A protein, isolated from the B 
subunit. To test this idea, the bcgIA gene was cloned 
and over-expressed in E. coli cells that lacked bcgIB and 
the A protein purified to homogeneity (see 'Materials 



and Methods' section). In solution, BcgIA was found by 
AUC and native MS to exist primarily as a 71.5 kDa 
monomer (48). The fact that the isolated BcgIA protein 
exists as a monomer concurs with the native MS on the 
native Bcgl protein, which had shown A monomers, AB 
and A 2 B assemblies, but no A 2 dimers (Figure 1). 

When the purified A protein was added by itself to the 
one-site plasmid pUC19, no DNA cleavage was observed 
even after long incubations with high BcgIA concentra- 
tions (Figure 8, red line). BcgIA was then added to pUC19 
together with intact Bcgl protein. However, the latter was 
present at the same concentration (5nM) as the DNA, 
which is too low for it to cleave the DNA unaided 
(Figure 5a). Samples were taken from the mixture and a 
portion of each analysed by electrophoresis through 
agarose to separate the SC, OC and LIN forms of 
pUC19. Though neither protein alone was capable of 
cutting this DNA, together they rapidly converted most 
of the SC plasmid directly to LIN DNA, without 
accumulating nicked OC forms en route (Figure 8). The 
LIN DNA could have been cut in both strands on just one 
side of the Bcgl site, to give a LIN species of 2686 bp, or 
on both sides to release the 34-mer product and a LIN 
DNA of 2652 bp. As these LIN forms are inseparable on 
agarose, a second portion from each sample was applied 
to a polyacrylamide gel to capture the 34-mer (41). The 
increase in the concentration of the 34-mer matched that 
for the LIN DNA (Figure 8). The A protein must there- 
fore enable the intact protein to make DSBs on both sides 
of its site. Hence, both A subunits in the A 2 B protomer at 
the recognition site interact with free BcgIA monomers to 
give an assembly with two pairs of nuclease active sites, 
each of which makes a DSB. If the addition of A to A 2 B 
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Figure 8. Activation of Bcgl by BcgIA. The reactions, in 200 ul buffer 
R* at 37°C, contained lOOnM BcgIA protein and 5nM 3 H-labelled 
pUC19 (initially 90% SC monomer), in either the absence or 
presence of 5 nM WT Bcgl. Aliquots (20 ul) were removed at the 
times shown and immediately quenched before analysis by electrophor- 
esis: one-half through agarose, to separate the SC, OC and LIN forms 
of the plasmid; the other half through polyacrylamide, to capture the 
34-mer. The concentrations of the various forms of pUC19 were 
evaluated from the agarose gel and that for the 34-mer from the poly- 
acrylamide gel (see 'Materials and Methods' section). For the reaction 
with BcgIA alone, only the level of SC DNA is shown: red unfilled 
triangles and line. For the reaction with both BcgIA and native Bcgl, 
the following were assessed: intact SC DNA, white circles; nicked OC 
DNA, white squares; LIN DNA, with at least one DSB at the Bcgl site, 
white inverted triangles; the 34-mer, black triangles. Mean values from 
three repeats are shown: error bars denote standard deviations. 
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had resulted in only one unit for DSBs, it would have first 
cut one side of the site and only later the other side: the 
34-mer would then have been liberated not at the same 
time as the LIN species but instead at some later time. 
Strikingly, extra A subunits also enhance the MTase 
activity of the Bcgl protein (46) but, as the MTase methy- 
lates only one adenine at a time, the activation of the 
MTase almost certainly does not require the catalytic 
functions for methyl transfer in the free A subunit. In 
contrast, the liberation of the 34-mer shows that the 
extra A subunits contribute to the Bcgl REase functional 
active sites for phosphodiester hydrolysis. 

The effect of the A protein was also examined at higher 
concentrations of native Bcgl to see whether or not add- 
itional BcgIA could still enhance the cleavage rate. In re- 
actions containing 25 nM WT Bcgl and 5nM pUC19, all 
of the DNA was cleaved readily (Figure 5a). When these 
reactions were supplemented with lOOnM BcgIA (as in 
Figure 8), no difference was observed in either the extent 
or the rate of the reaction relative to that in the absence of 
the A protein (data not shown). Hence, the additional 
catalytic domains that the REase at the recognition site 
needs for its nuclease reaction can come from either free A 
subunits or excess A 2 B protomers. Nevertheless, the acti- 
vation requires a lower concentration of A 2 B units than 
free A subunits. 



DISCUSSION 

The substrate for the REase activity of the Bcgl RM 
protein (Figure 9a) consists of two copies of its recogni- 
tion sequence (24,40,41). In contrast to the many restric- 
tion enzymes that are most active when bound to two sites 
but which still have some activity when bound to a single 
site (22,54-56), the Bcgl REase has an absolute require- 
ment for two sites. It has no detectable activity when 
bound to a single site on a DNA held in physical isolation 
from any other DNA, but that isolated site is cleaved 
when a second specific DNA is added (41). At each site, 
it cuts both strands at two separate loci, one 10/12 nt 
upstream and the other 12/10 nt downstream (38). After 
binding to its substrate, the cleavage reaction yields none 
of the nicked species cut in one strand and only low levels 
of the intermediates with one DSB. Instead, most of the 
DNA is converted directly to the final products with two 
DSBs at both sites. Hence, after spanning two sites, the 
REase can cut all eight bonds within the lifetime of the 
synaptic complex (41). To achieve this feat, the REase 
activity of Bcgl most likely requires an assembly contain- 
ing eight catalytic centres for phosphodiester hydrolysis. 
Given the highly concerted nature of its reaction, it is 
unlikely that it sequentially moves its catalytic centres 
from bond to bond, especially not from a bond in the 
5—3' strand to one in the 3—5' strand. 

The A subunit of the Bcgl RM system carries both 
REase and MTase functions, while B resembles a DNA 
specificity subunit (HsdS) from a Type I RM system 
(12,39). HsdS subunits recognize bipartite sequences via 
separate domains for each segment, with each domain 
linked to a conserved region that associates with a 
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Figure 9. Scheme for action at eight phosphodiester bonds by Bcgl. 
(a) The substrate for the REase activity of Bcgl consists of two 
copies of its recognition sequence. The sites, shown here aligned in 
parallel, can be located either on the same DNA molecule, in cis, or 
on separate molecules, in trans. The bipartite recognition site is 
indicated by black segments, with an arrowhead to mark its 5—3' orien- 
tation: strand polarities are also indicated at DNA termini. Cleavage 
loci 10 and 12 nt distant from the site are marked with vertical arrows: 
upstream and downstream loci are noted with — or + signs, respect- 
ively, (b) The two sites in (a) are both shown bound to one A 2 B 
protomer of Bcgl: an interaction between the protomers is suggested 
by the double-headed arrow (in red). The B subunit (yellow oval) is 
responsible for DNA specificity and is positioned spanning both 
segments of the recognition sequence. The A subunits carry both 
REase and MTase activities and are illustrated as separate domains 
connected by a flexible linker. The MTase domains (green squares) 
overlay the sites of methylation in the specified segments of the recog- 
nition sequence. The REase domains (cyan triangles) are placed against 
the target bonds at the —10 and +10 positions, upstream and down- 
stream of the site in top and bottom strands, respectively: to mark their 
orientation on the DNA, the triangle points towards the 5'-end of each 
strand. At this stage, Bcgl has yet to engage the scissile bonds at the 
— 12 and +12 positions, (c) As (b) except that two additional A 2 B units 
from free solution now bridge the A 2 B units bound to each site via 
interactions between the nuclease domains of the free and the 
DNA-bound units. The two nuclease domains from one of the extra 
A 2 B units engage the scissile bonds at the —12 positions upstream of 
both recognition sites (on the left, as drawn in [c]) while those from the 
other occupy the downstream +12 positions in both sites (on the right). 
The scheme thus results in a dimer of nuclease domains, with two 
catalytic centres for phosphodiester hydrolysis at all four loci where 
Bcgl makes a DSB. 
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MTase subunit in a pseudo-symmetric fashion (6,57). 
BcgIB appears to be organized similarly (39). Following 
SDS-PAGE, it appeared from protein staining that Bcgl 
contained two mole A per mole B (K94), but gel stains 
are often far from quantitative (42). Nevertheless, native 
MS of the intact protein under conditions that minimize 
the disruption of non-covalent complexes (49) validated 
the earlier proposal and showed that Bcgl is indeed 
composed of A 2 B units (Figure 1). Under these condi- 
tions, however, we also find some lone A and B subunits 
and an AB assembly but no A 2 dimers. In all likelihood, 
the B subunit occupies the central position in the A 2 B 
assembly with A subunits symmetrically disposed on 
either side, both interacting with B but not with each 
other. This configuration is the same as that for the M 2 S 
structure of a Type I MTase (6,57) and a similar arrange- 
ment has been proposed for the Ahdl MTase, a system 
that has a Type II REase but a Type I-like MTase (58). 
Native MS also revealed the presence of species containing 
multiple copies of the A 2 B protomer, from an (A 2 B) 2 
dimer to at least up to a (A 2 B) 4 tetramer, albeit with pro- 
gressively decreasing peak heights at higher masses 
(Figure 1). Yet, while native MS can identify these 
larger species, peak heights do not provide quantitative 
information about the amount of each species present in 
the sample as it was injected from the nano-ESI, since the 
sensitivity of the ion detector declines with increasing m/z 
values. Nonetheless the spectra indicated no preference for 
one oligomer over another, as the intensity of the peaks 
from the aggregates declined steadily with size. 

AUC was therefore used to investigate the self- 
association of the Bcgl protein (Figures 2 and 3). At 
sub-micromolar concentrations, Bcgl existed in solution 
as a single ideal species with a MW matching that for an 
individual protomer (Figure 2a). But at micomolar con- 
centrations, of the order of those required for maximal 
nuclease activity (Figure 6c), multiple aggregates were 
present (Figure 2b). The M w values increased with 
increasing concentrations of Bcgl, to levels above that 
for a dimeric (A 2 B) 2 unit (Figure 3). As the M w values 
reflect all of the species in the cell, the mixtures must 
contain some of greater mass than the dimer. Bcgl thus 
readily associates with itself to give large assemblies con- 
taining several copies of the A 2 B protomer. Its complex 
with non-specific DNA also contained multiple protomers 
(Figure 4a). In contrast, the addition of Bcgl to a specific 
duplex gave an insoluble complex that precipitated imme- 
diately from solution, as judged by its removal from 
solution on centrifugation at low velocity (Figure 4b) 
and by its inability to enter a polyacrylamide gel on elec- 
trophoresis (Figure 7a). The binding of Bcgl to specific 
DNA thus generates highly aggregated states that 
possibly link together several DNA molecules, but these 
then slowly dissociate to release individual DNA-protein 
complexes back into solution. 

The cleavage of a plasmid with one Bcgl site required 
excess protein over DNA (Figure 5a), which raises the 
possibility that this REase may carry out just one 
turnover. It had been suggested that further cleavage re- 
actions were blocked by the excised 34-mer remaining 
bound to the enzyme while it was being methylated (12). 



Yet the fact that nuclease reactions on a plasmid with two 
sites proceed to completion when the enzyme is present at 
lower concentrations than sites on the DNA shows that 
the Bcgl REase is capable of multiple turnovers, with each 
enzyme molecule participating in more than one cleavage 
event. Bcgl is therefore not a suicide enzyme (59,60), one 
that is inactivated by its own reaction product. The need 
for excess enzyme for its reaction in trans on one-site 
DNA, despite its reactions in cis featuring multiple turn- 
overs, can be accounted for if the Bcgl proteins bound to 
both recognition sites have to recruit additional molecules 
of protein before they become active. In this situation, a 
reaction in trans will require two DNA molecules to be 
both loaded with enzyme at the recognition site and with 
the extra enzyme. But if the enzyme binds tightly to its 
cognate site as seems to be the case here (Figure 7c), fully 
loaded molecules of the one-site DNA carrying the extra 
enzyme will appear in solution only after adding a stoi- 
chiometric excess of enzyme over sites. Conversely, for a 
reaction in cis, the Bcgl molecules bound to the two rec- 
ognition sites in the same DNA molecule may interact 
with each other cooperatively (16), which could lead 
directly to the capture of the additional molecules of 
protein required for the cleavage reaction. 

When bound to its recognition site, the A 2 B protomer is 
in a position to place its two nuclease domains against two 
of the four target phosphodiester bonds at that site 
(Figure 9b). This would give a structure similar to that 
for M.Ahdl (58), an M 2 S MTase but which is thought 
from neutron scattering data to have arms extending 
from both M subunits. In the scheme shown in 
Figure 9b, the nuclease domains of the A subunits are 
placed against the —10 and +10 targets in the two 
strands, one upstream and one downstream of the site, 
but an equally plausible model would put these domains 
against the —12 and +12 bonds. On the other hand, it is 
improbable that an A 2 B unit places one nuclease centre 
against a bond 10 nt distant from the site and the other 
12 nt away, as this would break the intrinsic symmetry of 
the system. Moreover, the positioning of one nuclease 
domain at a 10 position and the other at a 12 position 
calls for an implausible reconfiguration of one A subunit: 
an attack on both scissile bonds in the same strand (say 
— 10 and +12) requires a 180° rotation of the nuclease 
domain in one A subunit relative to the other, to keep 
register with strand polarity; alternatively, to attack both 
strands at one cleavage locus (say —10 and —12), the 
distance between the nuclease and MTase domains in 
one A subunit would have to be massively different 
from that in the other. The synaptic complex of two 
A 2 B protomers and two recognition sites (Figure 9b) 
would thus seem to be incapable of making a DSB at 
any single cleavage locus. Indeed, it is probably incapable 
of cutting even one bond since no nicked species appear 
during DNA cleavage by Bcgl (Figure 8: 41). 

The monomer of the Fokl REase binds to its asymmet- 
ric site via its DNA recognition domain (27) and then 
moves its catalytic domain to its target bond, 1 3 nt away 
in the bottom strand, but the REase remains inactive, not 
even nicking the DNA (32). It becomes active only after a 
second catalytic centre engages the target bond in the top 
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strand. The second centre can come from another 
monomer of the Fokl protein, either in free solution or 
bound elsewhere to the same DNA (31), or from just its 
catalytic domain (30). A similar scheme may also apply to 
the Bcgl REase (Figure 9c). Much higher concentrations of 
Bcgl protein are required to cleave DNA (Figure 6c) than 
to bind to specific DNA (Figure 7c) so the complex formed 
at the cognate site must remain inactive until it recruits 
additional molecules of the protein from free solution. 
Bcgl bound to its site can also be activated by the A 
subunit alone (Figure 8). The function of either the add- 
itional A 2 B protein, or BcgIA alone, is presumably to as- 
sociate transiently with the A subunits of the DNA-bound 
enzyme to position two catalytic centres for phosphodiester 
hydrolysis at each cleavage locus: the two centres can then 
cut both strands at that locus. Moreover, as the activation 
of the Bcgl REase by the A protein alone leads directly to 
the excision of the 34-mer (Figure 8), both A subunits in the 
A 2 B protomer on the DNA must interact with free A 
protein before any reaction. 

Figure 9 shows a proposal for the juxtaposition of eight 
catalytic centres against the eight target bonds in the Bcgl 
substrate. In this scheme, one A 2 B protomer binds to each 
recognition site and positions its two centres for nuclease 
activity, one in each A subunit, against one particular set 
of phosphodiester bonds, illustrated here (Figure 9b) as 
the —10 and +10 targets. Two additional A 2 B protomers 
from free solution then become incorporated into the 
synaptic complex through interactions between their 
nuclease domains and the nuclease domains of the 
DNA-bound proteins (Figure 9c). The additional 
protomers bridge the two DNA sites, so that (in the con- 
figuration drawn) one of the extra protomers engages the 
— 12 targets upstream of both sites and the other the +12 
targets downstream of both sites. Alternative configur- 
ations for the bridging protomers are also possible: for 
example, they could span "diagonally" the upstream 
locus at one site and the downstream locus at the other 
site. Indeed, the insolubility of the specific DNA-protein 
complex for Bcgl seen in the AUC (Figure 4b) could be 
due to the two bridging protomers spanning not the same 
two DNA-bound protomers to form the closed structure 
shown in Figure 9c but rather an open-ended structure 
with one DNA-bound protomer being linked via 
bridging units to two different DNA-protein complexes. 
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